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Channel-aware Decentralized Detection via 
Level- triggered Sampling 

Yasin Yilmaz^, George V. Moustakides-f, and Xiaodong Wangf 

Abstract 

We consider decentralized detection through distributed sensors that perform level-triggered sampling 
and communicate with a fusion center (FC) via noisy channels. Each sensor computes its local log- 
likelihood ratio (LLR), samples it using the level-triggered sampling mechanism, and at each sampling 
instant transmits a single bit to the FC. Upon receiving a bit from a sensor, the FC updates the global 
LLR and performs a sequential probability ratio test (SPRT) step. We derive the fusion rules under 
various types of channels. We further provide an asymptotic analysis on the average decision delay 
for the proposed channel-aware scheme, and show that the asymptotic decision delay is characterized 
by a Kullback-Leibler information number. The delay analysis facilitates the choice of the appropriate 
signaling schemes under different channel types for sending the 1-bit information from the sensors to 
the FC. 

Index Terms: Decentralized detection, level-triggered sampling, SPRT, channel-aware fusion, KL information, 
asymptotic analysis, sequential analysis. 

I. Introduction 

We consider the problem of binary decentralized detection where a number of distributed sensors, 
under bandwidth constraints, communicate with a fusion center (FC) which is responsible for making 
the final decision. In fTJ it was shown that under a fixed fusion rule, with two sensors each transmitting 
one bit information to the FC, the optimum local decision rule is a likelihood ratio test (LRT) under the 
Bayesian criterion. Later, in (2) and (5J it was shown that the optimum fusion rule at the FC is also an 
LRT under the Bayesian and the Neyman-Pearson criteria, respectively. It was further shown in (4) that 
as the number of sensors tends to infinity it is asymptotically optimal to have all sensors perform an 
identical LRT. The case where sensors observe correlated signals was also considered, e.g., (5), (6). 
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Most works on decentralized detection, including the above mentioned, treat the fixed-sample-size 
approach where each sensor collects a fixed number of samples and the FC makes its final decision 
at a fixed time. There is also a significant volume of literature that considers the sequential detection 
approach where both, the sensor local decision times and the FC global decision time are random, e.g., 
(7)-(T2). Regarding references (10)-(T2) we should mention that they use, both locally and globally, 
the sequential probability ratio test (SPRT), which is known to be optimal for i.i.d. observations in 
terms of minimizing the average sample number (decision delay) among all sequential tests satisfying 
the same error probability constraints (13). SPRT has been shown in |l4j Page 109] to asymptotically 
require, on average, four times less samples (for Gaussian signals) to reach a decision than the best 
fixed-sample-size test, for the same level of confidence. Relaxing the one-bit messaging constraint, the 



optimality of the likelihood ratio quantization is established in [ [15] ]. Data fusion (multi-bit messaging) 
is known to be much more powerful than decision fusion (one-bit messaging) (16), albeit it consumes 
higher bandwith. Moreover, the recently proposed sequential detection schemes based on level-triggered 



sampling in (11) and (12] are as powerful as data-fusion techniques, and at the same time they are as 
simple and bandwidth-efficient as decision-fusion techniques. 

Besides having noisy observations at sensors, in practice the channels between sensors and the FC 
are noisy. The conventional approach to decentralized detection ignores the latter, i.e., assumes ideal 
transmission channels, and addresses only the first source of uncertainty, e.g., (T), (TT) . Adopting the 
conventional approach to the noisy channel case yields a two-step solution. First, a communication 
block is employed at the FC to recover the transmitted information bits from sensors, and then a signal 
processing block applies a fusion rule to the recovered bits to make a final decision. Such an independent 
block structure causes performance loss due to the data processing inequality (TT) . To obtain the optimum 



performance the FC should process the received signal in a channel-aware manner [[18] ], (19] ]. Most works 
assume parallel channels between sensors and the FC, e.g., (20) , (21). Other topologies such as serial 
(22) and multiple-access channels (MAC) [23 ] have also been considered. In (24) a scheme is proposed 
that adaptively switches between serial and parallel topologies. 

In this paper, we design and analyze channel-aware sequential decentralized detection schemes based 
on level-triggered sampling, under different types of discrete and continuous noisy channels. In particular, 
we first derive channel-aware sequential detection schemes based on level-triggered sampling. We then 
present an information theoretic framework to analyze the decision delay performance of the proposed 
schemes based on which we provide an asymptotic analysis on the decision delays under various types 
of channels. Based on the expressions of the asymptotic decision delays, we also consider appropriate 
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signaling schemes under different continuous channels to minimize the asymptotic delays. 

The remainder of the paper is organized as follows. In Section [TTJ we describe the general structure 
of the decentralized detection approach based on level-triggered sampling with noisy channels between 
sensors and the FC. In Section [Tn| we derive channel-aware fusion rules at the FC for various types 
of channels. Next, we provide analyses on the decision delay performance for ideal channel and noisy 
channels in Section |IV| and Section |V| respectively. In Section |Vl| we discuss the issue of unreliable 
detection of the sensor sampling times by the FC. Simulation results are provided in Section [VIl| Finally, 



Section VIII concludes the paper. 



II. System Descriptions 

Consider a wireless sensor network consisting of K sensors each of which observes a Nyquist-rate 
sampled discrete-time signal {y^t E N},fc = 1, Each sensor k computes the log-likelihood 

ratio (LLR) {L^t E N} of the signal it observes, samples the LLR sequence using the level-triggered 
sampling, and then sends the LLR samples to the fusion center (FC). The FC then combines the local 
LLR information from all sensors, and decides between two hypotheses, Ho and Hi, in a sequential 
manner. 

Observations collected at the same sensor, {y^}t, are assumed to be i.i.d., and in addition observations 
collected at different sensors, {yt}k, are assumed to be independent. Hence, the local LLR at the fc-th 
sensor, L\, and the global LLR, L t , are computed as 

^±lo g f^--f =Lt 1 + l* = ±t and = (1) 

respectively, where l\ = log is the LLR of the sample y\ received at the k-th sensor at time t\ 

fi, i = 0, 1, is the probability density function (pdf) of the received signal by the k-th sensor under 
H^. The k-th sensor samples L\ via the level-triggered sampling at a sequence of random sampling 
times {t k } n that are dictated by L\ itself. Specifically, the n-th sample is taken from L\ whenever the 
accumulated LLR L\ — L k k , since the last sampling time t k n _\ exceeds a constant A in absolute value, 

^ n — 1 

i.e., 

t ^ inf [t > t_ x : L k t - L\_^ £ (-A, A)} , t\ = 0, L k = 0. (2) 
Let denote the accumulated LLR during the n-th inter-sampling interval, (t^_ l5 t^], i.e., 

x k n = J2 z " = 4-4-r ^ 
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Fig. 1. A wireless sensor network with K sensors Si, ... , Sk, and a fusion center (FC). Sensors process their observations 
{yt}, and transmits information bits {b^}. Then, the FC, receiving {z^} through wireless channels, makes a detection decision 
5f-. Ii(t), Ii(t), Ii(t) are the observed, transmitted and received information entities respectively, which will be defined in 
Section Hvl 



Immediately after sampling at t\, as shown in Fig. [TJ an information bit indicating the threshold 
crossed by A^ is transmitted to the FC, i.e., 

b k n 4 sign(A^). (4) 

Note that each sensor, in fact, implements a local SPRT [cf. ([8]), (|9j)], with thresholds A and —A 
within each sampling interval. At sensor k the n-th local SPRT starts at time t\_\ and ends at time 
when the local test statistic A^ exceeds either A or —A. This local hypothesis testing produces a local 
decision represented by the information bit 6^, and induces local error probabilities and which are 
given by 

«t = Po(^ = l), and p k ±P 1 (b* = -l) (5) 

respectively, where Pi(-), i = 0, 1, denotes the probability under H^. 

Let us now analyze the signals at the FC. Denote the received signal at the FC corresponding to as 
z\ [cf. Fig. [TJ. The FC then computes the LLR A^ of each received signal and approximates the global 
LLR L t as 

K N t k k k 

£. = EE^ with ~ X n = ^^y ( 6 ) 

k=l n=l 1 u v a/ 

where iV^ is the total number of LLR messages the fc-th sensor has transmitted up to time t, and 
Pi(')i i = 0, 1, is the pdf of under H^. In fact, the FC recursively updates L t whenever it receives an 
LLR message from any sensor. In particular, suppose that the m-th LLR message A m from any sensor 
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is received at time t m . Then at £ m , the FC first updates the global LLR as 

Lt m = Lt^ + A m . (7) 

It then performs an SPRT step by comparing L tm with two thresholds A and —B, and applying the 
following decision rule 

' H l5 if Ltm >A 

Ho, if L tm <-B, (8) 

continue to receive LLR messages, if L tm E (—5, A). 

The thresholds (A, B > 0) are selected to satisfy the error probability constraints Po(Sf = Hi) < a and 
Pi(Sf = Hq) < /3 with equalities, where a, (3 are target error probability bounds, and 

t = mi{t>0:L t g(-B,A)} (9) 

is the decision delay. 

With ideal channels between sensors and the FC, we have z k = b*, so from ^ we can write the local 
LLR A^ = A*, where 

{loo- Pl (^=i) — loo- 1-/3fc > A if b k — 1 

g Po(6}L=l) i0g a fc - ^' 11 °n - ^ ^ 

loo- Pl ( 6 ^ = ~ 1 ) — log- ^ fc < — A if h k — -1 
g P (&J = -1) ~~ S l-CKfc - ' ^ ~~ 

where the inequalities can be easily obtained by applying a change of measure. For example, to show 
the first one, we have ak = Po(A^ > A) = Eo[1{aj>a}] where E^[ ] is the expectation under H^, i = 0, 1 

and tr\ is the indicator function. Noting that e A - = fk( ^r, we can write 

x s Ti \y+k , n T-^y+k ) 



Ei[ e -^1 {AJ ,> A} ] < e- A E![l {A ,> A} ] = e" A Pi(A* > A) = e~ A (l - fa). 



Note that for the case of continuous-time and continuous-path observations at sensors, the inequalities 
in ( [TO] ) become equalities as the local LLR sampled at a sensor [cf. ([I])] is now a continuous-time and 
continuous-path process. This suggests that the accumulated LLR during any inter- sampling interval [cf. 
([3])] due to continuity of its paths will hit exactly the local thresholds ±A. Therefore, from Wald's 
analysis for SPRT = fa = ^a^x 1251; hence a transmitted bit fully represents the LLR accumulated 
in the corresponding inter-sampling interval. Accordingly, the FC at sampling times exactly recovers the 



values of LLR processes observed by sensors (TTJ. 

When sensors observe discrete-time signals, due to randomly over(under) shooting the local thresholds, 



X k in ([3]) is a random variable which is in absolute value greater than A. However, A^ in ( 10) is a fixed 
value that is also greater than A in absolute value. While in continuous-time the FC fully recovers the 
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LLR accumulated in an inter-sampling interval by using only the received bit, in discrete-time this is 
not possible. In order to ameliorate this problem, in fTT) it is assumed that the local error probabilities 



{oik, /3k} are available to the FC; and therefore the LLR of z%, i.e., A^, can be obtained; while in 1 12| the 
overshoot is quantized by using extra bits in addition to b^. Nevertheless, neither method enables the FC 
to fully recover A^ unless an infinite number of bits is used. In this paper, to simplify the performance 
analysis, we will assume, as in (TTJ, that the local error probabilities a^, k = 1, . . . , K are available 
at the FC in order to compute the LLR A^ of the received signals. Moreover, for the case of ideal 
channels, we use the A and — B to denote the thresholds in i.e., A = A, B = B, and use T to denote 
the decision delay in i.e., T = T. 

In the case of noisy channels, the received signal z\ is not always identical to the transmitted bit 6^, 



and thus the LLR A* of z\ can be different from A^ of given in dlOb. In the next section, we consider 



some popular channel models and give the corresponding expressions for A^. 

III. Channel-aware Fusion Rules 

In computing the LLR A^ of the received signal z%, we will make use of the local sensor error 
probabilities a^,/?/-, and the channel parameters that characterize the statistical property of the channel. 
One subtle issue is that since the sensors asynchronously sample and transmit the local LLR, in the 
presence of noisy channels, the FC needs to first reliably detect the sampling time in order to update the 
global LLR. In this section we assume that the sampling time is reliably detected and focus on deriving 



the fusion rule at the FC. In Section VI we will discuss the issue of sampling time detection. 



A. Binary Erasure Channels (BEC) 

Consider binary erasure channels between sensors and the FC with erasure probabilities e&, k = 
1, . . . , K. Under BEC, a transmitted bit b^ is lost with probability e^, and correctly received at the FC, 
i.e., z\ = b\, with probability 1 — e*.. Then the LLR of z% is given by 

f i g p44=ii = log i=& if 4 = 1, 

U I lo^ Pl (^ = ~ 1 ) — log- ^ k if 7 k — —1 

Note that under BEC the channel parameter is not needed when computing the LLR A^. Note also 
that in this case, a received bit bears the same amount of LLR information as in the ideal channel case, 
although a transmitted bit is not always received. Hence, the channel-aware approach coincides with the 



conventional approach which relies solely on the received signal. Although the LLR updates in (10) and 



( [TT] ) are identical, the fusion rules under BEC and ideal channels are not. This is because the thresholds 
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A and — B of BEC, due to the information loss, are in general different from the thresholds A and — B 
of the ideal channel case. 

B. Binary Symmetric Channels (BSC) 

Next, we consider binary symmetric channels with crossover probabilities e k between sensors and the 
FC. Under BSC, the transmitted bit b\ is flipped, i.e., z k = —6^, with probability e&, and it is correctly 
received, i.e., z k = b k , with probability 1 — e k . The LLR of z k can be computed as 

V>(* - n - In. = l \ h n = l ^ b n = l ) + = ^ = "l)Pl(&£ = 



P (4 = = l)Po(6* = 1) + Po(4 = = -l)Pc 



= lQg (l-e k )(l-p k ) + e k p k = l-[(l-2e fe )/3 fe + 6 fe ] ^ 
(1 - e/e)oi fc + efc(l - Oi/e) (1 - 2efc)«fc + e fe 

N v ' 

OLk 

where a k and are the effective local error probabilities at the FC under BSC. Similarly we can write 

A*(^ = -l) = lo gT A_. (13) 

Note that a k > a k , $ k > /3 k if a& < 0.5, /3 k < 0.5, Vfe, which we assume true for A > 0. Thus, we 
have | BSC \ < |A^ BEC \ fr° m which we expect the performance loss under BSC to be higher than the 



one under BEC. The numerical results provided in Section |V-B will illustrate this claim. Finally, note 



also that, unlike the BEC case, under BSC the FC needs to know the channel parameters {e k } to operate 
in a channel-aware manner. 

C Additive White Gaussian Noise (AWGN) Channels 

Now, assume that the channel between each sensor and the FC is an AWGN channel. The received 
signal at the FC is given by 

z k n = h k n x k n + w k n (14) 

where h k = h k ,\/k, n, is a known constant complex channel gain; w k ~ A/" c (0, o^); x k is the transmitted 
signal at sampling time t k given by 



a, if \ k n > A, 

(15) 

b, if\ k n <-A. 



where the transmission levels a and b are complex in general. 
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The distribution of the received signal is then z k ~ A/" c (%a; re , <r|). The LLR of z\ is given by 

.Pk{Zn\ x n = a ) P l( x n = a ) + Pk{z n \x k n = b)?i(x k n = b) 



>t =loe 



log 



Vk{A\x k n = a)P (x* = a ) +Pk(Zn\ x n = b ) P o( x n = b ) 

(1 - /3 k ) exp(-c n ) + p h exp(-d n ) 



a k exp(-c n ) + (1 - a k ) exp(-c? n ) : 
where cl 4 and d fc A M^!. 



(16) 



D. Rayleigh Fading Channels 

If a Rayleigh fading channel is assumed between each sensor and the FC, the received signal model 



is also given by (14)-(15), but with h k n ~ AA c (0, a^). We then have z\ ~ AA c (0, \x^a\ k + cr|); and 
accordingly, similar to (16), A^ is written as 



A^log 



exp(-c£) + ^exp(- 

a,fc b,k 



&k 



■exp(-c*) + ^exp(-d*) 



(17) 



6 . A- 
fe 12 



where < fc 4 | a | 2(J 2 fc + ^ ^ A ^ + ^ c * A and ^ A J^. 

£. Rician Fading Channels 

For Rician fading channels, we have ~ Af c (Hk, °\ &) ^ n an< ^ hence 2^ ~ Af c (^k x ni \ x n\ 2<J h k + 
cr|). Using cr^ fe and cr^ as defined in the Rayleigh fading case, and defining c k = ^"f^l , d k = 

' ' ., & a ,k 

\ z *~? kb \ 2 we can write A^ as in (17). 



IV. Performance Analysis for Ideal Channels 

In this section, we first find the non-asymptotical expression for the average decision delay Ej[T], and 
then provide an asymptotic analysis on it as the error probability bounds a, 0. Before proceeding to 
the analysis, let us define some information entities which will be used throughout this and next sections. 



A. Information Entities 

Note that the expectation of an LLR corresponds to a Kullback-Leibler (KL) information entity. For 
instance, 



Jf(t) 4 Ex 



log 



f k (y k ,...,y k ) 



= Ei[L*], and 7 fc (t)^E 



log 



= -E [L k ] (18) 
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are the KL divergences of the local LLR sequence {L^ } t under Hi and Hq, respectively. Similarly 



/f (t) ^ El 



if (i) 4 El 



log 



log 



pg(6f, . . . ,6* 



iV, fc 



p\(z\,...,z 



E^] , /*(t) ^ -Eo[£f] 



E![£f] , i*(t) 4 -E [L* 



(19) 



"0 V 1 ' ' ' ' ' ^AT t fc / 

are the KL divergences of the local LLR sequences {L^ } t and {L^ } t respectively. Define also Ii(t) = 
- ELi and - Ek=Ji(t) as the KL divergences of the global LLR 

sequences and {L^} respectively. 

In particular, we have 



= Ei 



log 



/o fc (i/f) 



= Ei[Zf], and = E 



log 



fHvi) 



= -EoPf] 



(20) 



as the KL information numbers of the LLR sequence {If}; and = Ylk=i * = 0, 1 are those 

of the global LLR sequence {It}. Moreover, 



k(4.k\ 
1) 



log 



fi(Vi,---,V%) 

fo(yt--^y^) 



=Ei[Af], Jf(t^ 



and /f (tf) 



Po(*i). 



log 



pg(6j) 



Ei[Af], 



(21) 



Ei[Af] 



are the KL information numbers of the local LLR sequences {A^}, {A^}, and {A^}, respectively, under 



Hi. Likewise, we have /q(^ 



-E r 



k(+k\ 



-Er 



, and I%(t\) = -E [A*] under H . To 



summarize, l\ (t), if (£), and if (t) are respectively the observed (at sensor fc), transmitted (by sensor k), 
and received (by the FC) KL information entities as illustrated in Fig. [T] 
Next we define the following information ratios, 



*k A Ij (t\) i ~fc a Ijiti) 



(22) 



which represent how efficiently information is transmitted from sensor k and received by the FC, 
respectively. Due to the data processing inequality, we have < fjf , fjf < 1, for i = 0, 1 and k = 1, . . . , K. 
We further define 



K 



K 



K 



1,(1) 4 £tf J*(l) = £ #(1), and 7,(1) 4 £#J*(1) = £ J*(l) (23) 

/c=l k=l k=l k=l 

as the effective transmitted and received values corresponding to the KL information 7^(1), respectively. 
Note that 7^(1) and 7^(1) are not real KL information numbers, but projections of 7^(1) onto the filtrations 
generated by the transmitted, (i.e., {b^}), and received, (i.e., {^}), signal sequences, respectively. This is 
because sensors do not transmit and the FC does not receive the LLR of a single observation, but instead 
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they transmit and it receives the LLR messages of several observations. Hence, we cannot have the KL 
information for single observations at the two ends of the communication channel, but we can define 
hypothetical KL information to serve analysis purposes. In fact, the hypothetical information numbers 



seen in the following sections. 

The KL information if(l) of a sensor whose information ratio, rjf, is high and close to 1 is well 
projected to the FC. Conversely, if (1) of a sensor which undergoes high information loss is poorly 
projected to the FC. Note that there are two sources of information loss for sensors, namely, the overshoot 
effect due to having discrete-time observations and noisy transmission channels. The latter appears only 
in fjf, whereas the former appears in both fjf and fjf. In general with discrete-time observations at sensors 
we have 7^(1) ^ 7^(1) and 1^(1) ^ 7^(1). Lastly, note that under ideal channels, since z\ = b^, Vfc, n, we 
have Iiil) = iiil). 

B. Asymptotic Analysis of Detection Delay 

Let {t% : t% = — denote the inter-arrival times of the LLR messages transmitted from the k-th 
sensor. Note that t% depends on the observations y* k +1 , . . . , y^ k , and since {yf} are i.i.d., {r^} are also 
i.i.d. random variables. Hence, the counting process {Nj*} is a renewal process. Similarly the LLRs {A^} 
of the received signals at the FC are also i.i.d. random variables, and form a renewal-reward process. Note 
from ^ that the SPRT can stop in between two arrival times of sensor k, e.g., <T < t^ +1 . The event 
Nj- = n occurs if and only if t\ = r\ + . . . + t% < T and = r\ + . . . + r^ +1 > T, so it depends 
on the first (n + 1) LLR messages. From the definition of stopping time (26} pp. 104] we conclude that 
Nj- is not a stopping time for the processes {t%} and {A^} since it depends on the (n + l)-th message. 
However, Nj- + 1 is a stopping time for {r^} and {A^} since we have Nj- + 1 = n <^> Nj- = n — 1 
which depends only on the first n LLR messages. Hence, from Wald's identity (26} pp. 105] we can 
directly write the following equalities 



and Iiil), defined using the information ratios t)\ 



)f and fjf, are crucial for our analysis as will be 




(24) 



n=l 



and 




(25) 



n=l 



We have the following theorem on the average decision delay under ideal channels. 
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Theorem 1. Consider the decentralized detection scheme given in Section^ with ideal channels between 



sensors and the FC. Its average decision delay under is given by 



I / J — ± . . + 



(26) 

4(1) 4(1) 

where 34 w a random variable representing the time interval between the stopping time and the arrival 



of the first bit from the k-th sensor after the stopping time, i.e., = t k 



T. 



Proof: From ( |24| ) and ([25]) we obtain 

~iV*L+l 
n=l 



Ei[rf] 



Ei[Af] 



where the left-hand side equals to Ei[T\ + Ei[34]- Note that E^r-f] is the expected stopping time of 
the local SPRT at the k-th sensor and by Wald's identity it is given by E^rf] = ^ ^ , provided that 
Ei[/f] ^ 0. Hence, we have 



E 



n=l A n 



Ei[Af] Ei[lf] 



i k (t*)iKT) + iH tk N* + i) 



where we used the fact that Ei 



Ei[L*.] + E![A* J 



E 



n=l A n 



JV fc +1 j and similarly 

= — Iq(T) — ^(t^ ). Note that Ej[-] is the expectation with respect to A^ fc . and 



JVf under H<. By rearranging the terms and then summing over k on both sides, we obtain 

k=i 'iVi) k=i ^ l \ ^hl 



which is equivalent to (26) 



The result in (26) is in fact very intuitive. Recall that h(T) is the KL information at the detection 



time at the FC. It naturally lacks some local information that has been accumulated at sensors, but has 
not been transmitted to the FC, i.e., the information gathered at sensors after their last sampling times. 
The numerator of the second term on the right hand side of ([26]) replaces such missing information by 



using the hypothetical KL information. Note that in (26) Ii{t k Nk , x ) ^ li^i)* i,e -' Ei[A^ fc , J ^ E^A^], 



since Nj- and A^ fc+1 are not independent. 

The next result gives the asymptotic decision delay performance under ideal channels. 



Theorem 2. As the error probability bounds tend to zero, i.e., a, (3 — » 0, the average decision delay 
under ideal channels given by ( [26] ) satisfies 

| log 0| 



Ei[n = ^r + 0(1), and E [T] 
M 1 ) 



4(1) 



+ 0(1), 



(27) 
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where O(l) represents a constant term. 



Proof: We will prove the first equality in ( |27| ), and the proof of the second one follows similarly. 
Let us first prove the following lemma. 

Lemma 1. As a, (3 — >> we teve the following KL information at the FC 

I^T) = |loga| + 0(l), owd 7 (T) = |log/3| + O(l). (28) 
Proof: We will show the first equality and the second one follows similarly. We have 
I^T) =Pi(L T > A)Ei[L r |L T > A] + Pi(L r < -B)E 1 [L r \L T < -B] 

= (1 - /3)(A + Ei[0 A ]) - 13(B + E^b}) (29) 

where 0a,0b are overshoot and undershoot respectively given by 6a — Lj- — A if L7- > A and 0g = 
— Lj- — B if L7- < —5. From fill Theorem 2], we have A < | log a\ and £> < | log so as a, /3 — )► 



(29) becomes ii(T) = A + E 1 [6a\ +o{1). From (10) we have |A*| < 00 if < a k , f3 k < 1. If we assume 



< A < 00 and < oo,Vfc,t, then we have < a^,/3/c < 1 and as a result Ii{t\) = E^A^] < 00. 
Since the overshoot cannot exceed the last received LLR value, we have 9 a, Ob < ® = max^ n |A^| < 00. 
Similar to Eq. (73) in JTT| we can write (3 > e~ B ~ e and a > e~ A ~ e where 6 = O(l) by the above 
argument, or equivalently, B > | log /3| — O(l) and A > | \oga\— O(l). Hence we have A = |loga|+0(l) 
and B = |log/3| + 0(1). ■ 
From the assumption of < oo,Vfc,t, we also have 7^(1) < 7^(1) < 00. Moreover, we have 



Ei[34] < ^[r-f] < oc since E^i] ^ 0. Note that all the terms on the right-hand side of (26) except 



for Ii(T) do not depend on the global error probabilities a,/3, so they are O(l) as — )► 0. Finally, 



substituting d28J) into d26b we get (27). 



It is seen from (27) that the hypothetical KL information number, 7^(1), plays a key role in the 
asymptotic decision delay expression. In particular, we need to maximize 7^(1) to asymptotically minimize 
Ei[T}. Recalling its definition 

we see that three information numbers are required to compute it. Note that if (1) = [Zf ] and Ii(t\) = 



Ei[Aj], which is given in (30) below, are computed based on local observations at sensors, thus do not 
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(30) 



depend on the channels between sensors and the FC. Specifically, we have 

/?(«}) = (1 - &)(A + E 1 [e k n }) - p k (A + E^]), 

and I^tl) = a k (A + E [^]) - (1 - a fc )(A + E [^]) 
where 9^ and are local over(under) shoots given by 9^ = A^ — A if A^ > A and 9^ = — A^ — A if 
A^ < -A. Due to having \l$\ < oo,Vfc,t we have 0^,0* < oo,V/c,n. 

On the other hand, if (t\) represents the information received in an LLR message by the FC, so it 
heavily depends on the channel type. In the ideal channel case, from ([TO]) it is given by 



and i%{t\) 



(l-/3 fc )log 



+ A log 



«fc log 



OLk 



+ (1 - Oft) log 



1 - OLk 



(31) 



OLk 



Since if (tj) is the only channel-dependent term in the asymptotic decision delay expression, in the next 



section we will obtain its expression for each noisy channel type considered in Section III 



V. Performance Analysis for Noisy Channels 

In all noisy channel types that we consider in this paper, we assume that channel parameters are either 
constants or i.i.d. random variables across time. In other words, hk are constant for all k (see Section 



III-A 



III-B 



III-C), and {hn} n ,{Wn} n are i.i.d. for all k (see Section 



III-C 



III-D 



III-E| ). Thus, in all 

noisy channel cases discussed in Section III the inter-arrival times of the LLR messages {r^}, and the 



LLRs of the received signals {A^} are i.i.d. across time as in the ideal channel case. Accordingly the 



average decision delay in these noisy channels has the same expression as ([26]), as given by the following 
proposition. The proof is similar to that of Theorem [T] 



Proposition 1. Under each type of noisy channel discussed in Section \HI\ the average decision delay is 
given by 

Ei[f] _i 1 (f) , zLim H d-£iWHi) (32) 



hiX) 



hiX) 



where 34 = ~ T. 



The asymptotic performances under noisy channels can also be analyzed analogously to the ideal 
channel case. 



Proposition 2. As a, (3 0, the average decision delay under noisy channels given by (32) satisfies 



Ei [71 



I loga| 



+ 0(1), and E [f] 



|log/3| 



+ 0(1). 



(33) 
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Proof: Note that in the noisy channel cases the FC, as discussed in Section [TTTj, computes the LLR, 
A^, of the signal it receives, and then performs SPRT using the LLR sum L t . Hence, analogous to Lemma 
we can show that h(T) = \ loga\ + O(l) and J (T) = | log /3| + O(l) as a,/3 0. Note also that 
due to channel uncertainties |A^| < |A^|, so we have I*(t\) < I*(t\) < oc and £(1) < ^(1) < oc. We 
also have E^[34] < ^[r-f] < oc as in the ideal channel case. Substituting these asymptotic values in (32) 
we get ([33]). ■ 
Recall that /*(!) = T^fj 7 ^ 1 ) in (33) where T K l ) and T i (*i) are independent of the channel 



type, i.e., they are same as in the ideal channel case. In the subsequent subsections, we will compute 
Ii(t\) for each noisy channel type. We will also consider the choices of the signaling levels a, b in (15) 
that maximize l\ (t\). 



A. BEC 

Under BEC, from ( fTTj ) we can write the LLR of the received bits at the FC as 

A^, with probability 1 — e&, 
0, with probability e^. 
Hence we have 

^(tf) = E i [Af] = (l-e*)J/ s («f) 



A 



(34) 



(35) 



where l\{t\) is given in (31). As can be seen in (35) the performance degradation under BEC is only 
determined by the channel parameters e^. In general, from ((27]), ((33]) and ( [35] ) this asymptotic performance 
loss can be quantified as 1 _ m } nfc Ck < f^jfj < i- m ax fc e k • Specifically, if = e,Vfc, then we have 

Eiin 



^3 - -J- as a, /3 -»■ 0. 



l-e 



B. BSC 



Recall from ( fT2| ) and ( [T3] ) that under BSC local error probabilities a^.fa undergo a linear transfor- 



mation to yield the effective local error probabilities otk,Pk at the FC. Therefore, using ([12]) and ((13]), 
similar to (31), lf{t\) is written as follows 

1 Pk . n I fa 

— h/3 fc log- — , 

rt/ 7 I — r\ 7 

(36) 



J?(tJ) = (!-&) log- 



and 7 fc (^) = c^log 



1 



+ (1 - a k ) log 



1 - «ifc 



«fc 1 - &k 

where ce^ = (1 — 2e/c)a^ + e/e and /3 k = (1 — 2e&)/3/ c + e^. Notice that the performance loss in this case 
also depends only on the channel parameter e&. 
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Fig. 2. The KL information, /f (tf), under BEC and BSC, as a function of the local error probabilities ctk — Pk and the 
channel error probability €k- 



In Fig. 2 we plot Ii(t\) as a function of a& = fik an d for both BEC and BSC. It is seen that the 
KL information of BEC is higher than that of BSC, implying that the asymptotic average decision delay 



is lower for BEC, as anticipated in Section III-B 



C. AWGN 

In this and the following sections, we will drop the sensor index k of a\ k and a\ for simplicity. In 

that if the transmitted signal is a, i.e., x\ = a, then = 



III-C 



the AWGN case, it follows from Section 

u,d k n = v a ;md if x k n = b, then c* = v b , d k n = u where nif,»^ \<+(a-b)h k ? > Vf> a R+(ft-a)M 2 



Accordingly, from (16) we write the KL information as 



=Ei[A?] = (l-&)E 



log 



= (!-&) log- 



+ Pk log 



(1 - fo)e-" + 
"fce - " + (1 - a/c)e-' ! '° 



log 



(1 - h)e~^ + /3 k e- 
+ (1 - a fc )e- 



+ 



Jf(tf) 



log 



1 + 



1-/3. 



1 + i^e""^ 



at 



+ E 



log 



1 + ^e u ~ Vb 

Pk 

1 + T °b-e u - v '> 

l-CKfc 



)■ 



(37) 



where E[ ] denotes the expectation with respect to the channel noise w% only, and Ei[-] denotes the 
expectation with respect to both x\ and w\ under Hi. Since w\ is independent of x\ under both Hq and 
Hi, we used the identity Ei[ ] = E[Ei[-]] in ([37]). 
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Note from El\ that we have if (t\) = jf (t\) + $ k C\ and I%(t\) = I%(t\) + a fe C§. Similar to C\ we 



have Cq = -£i - ^^£2- Since we know I^{t\) < V? (t\), the extra terms, C\ , Cg < are penalty terms 
that correspond to the information loss due to the channel noise. Our focus will be on this term as we 
want to optimize the performance under AWGN channels by choosing the transmission signal levels a 
and b that maximize C\. 

Let us first consider the random variables ( a = u — v a and = u — which are the arguments 
of the exponential functions in £\ and £2 in ( [37] ) . From the definitions of u and v a , we write ( a = 

\w^_ _ \ w *+{a-b)h h \* = _ |a-y fc | a _ _2_ 7 where 7 a »{(^)*( a - b)h k } and ^{-} denotes the real part 
of a complex number. Similarly we have ( b = _ l Q - b H^I 2 + ^ 7> Note that 7 - A/*(0, |a ~ 6|2 ^ fc|V2 ) since 

~ -A/"c(0, a 2 ). If we define z/ = | Q _^ lfc | cr 7, then we have z/ ~ AA(0, 1). Upon defining s 
we can then write ( a and (^5 as 

( a —s 2 — V2 sv and —s 2 + y/2 sv. 



A |a-&||fcfc| 

(J 



If we define F = 1=^ a nd G = then we have 



Cf =E 



log 
log 



1 + F~ l e^ h 
1 + Ge& 

1 + Ge^ 
1 + F~ l e^ 



+ F _1 E 



log 



log 



1 + Fe^ 
1 + G- 1 ^ 

1 + G -1 e^ 
1 + FeCa 



(38) 



Note from (14) that the received signal, z%, will have the same variance, but different means, ah k and 
bh k , if x^ = a and x^ = b are transmitted respectively. Hence, we expect that the detection performance 
under AWGN channels will improve if the difference between the transmission levels, \a — b\, increases. 
Toward that end the following result gives a sufficient condition under which the penalty term C\ increases 
with s, and hence with \a — b\. The proof is given in the Appendix. 

Lemma 2. C\ is an increasing function of s, i = 0, 1, if F 2 > G and G 2 > F. 

Lemma [2] indicates that for a k: f3 k values inside the region shown in Fig. [3} Cf is increasing in \a — b\. 
Note that a k , /3 k are l° ca l error probabilities which are directly related to the local threshold A. Therefore, 
even if the hypotheses Ho and Hi are non-symmetric, we can ensure that we will have a kl f3 k inside the 
region in Fig. [5] by employing different local thresholds, — A k and A k , in In fact, even for a k ,/3 k 
values outside the region in Fig. [3] numerical results show that C\ is increasing in s. 

Hence, maximizing Cf is equivalent to maximizing \a — b\. If we consider a constraint on the maximum 
allowed transmission power at sensors, i.e., max(|a| 2 , |6| 2 ) < P 2 , then the antipodal signaling is optimum, 
i.e., \a\ = \b\ = P and a —b. 
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D. Rayleigh Fading 
It follows from Section 



III-D 



that c* 



k 12 



u a when = a; and 



when 



x k n = b where u a 4 I°*Sd^, ^ = JMi^, and a 2 = | a |2 a 2 + a 2 ? a 2 = | 6 |2 a 2 + a 2 as defined in 
Define further p = . Hence, using jnj) we write the KL information as 



Section 



Jf(tf) =(1-A)E 



III-D 



log 



I ^2 « 



^|g-Wa -\- 1 ~g : fc e -pU a 



= (l-A)log 



1 



log 



log 



1— fffc ^— /Q i fh„—u b 



1 + Gp~ V b 
1 + F-V _1 e Cb 



log 



1 + G~ l pe^ 
1 + FpeCa 



/f(tf) 



(39) 



where £ a = u a (l — p) and = u^l — p x ). Note that when |a| = |6| which corresponds to the optimal 
signaling in the AWGN case, we have p = 1, ( a = = and therefore Ii(t\) = in (39). This result is 



quite intuitive since in the Rayleigh fading case the received signals differ only in their variances. Note 
that u a and are chi-squared random variables with 2 degrees of freedom, i.e., u^u^ ~ xh ^us we 
can write the penalty term as 



log 



1 + F- X p 



-lMi-p- 1 ) 



1 + Gp- 1 e»( 1 -/'- 1 ) 



+ F~ l lo. 



and Cf 



1 °6 1 + F -i p -i e u(i-p- 1 ) + gto 6 



1 + Fpe< x -d \ e~ 

1 + G-y (1_ ' ,) / ~2 

1 + G" V u(1 ~ p) \ 
1 + Fpe u ^-P) J 



-du, 



(40) 



Note that given local error probabilities a^,/^ ine integrals in (40) is a function of p only. However, 
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maximizing Cf in (40) with respect to p seems analytically intractable. As can be seen in Section III-D the 
received signals at the FC will have zero mean and the variances a 2 and a 2 when x\ = a and x\ = b re- 
spectively. Therefore, in this case intuitively we should increase the difference between the two variances, 
i.e., ||a| 2 - |6| 2 |. Consider the following constraints: max(|a| 2 , \b\ 2 ) < P 2 and min(|a| 2 , |6| 2 ) > Q 2 , 
where the first one is the peak power constraint as before, and the second is to ensure reliable detection 
of an incoming signal by the FC. We conjecture that the optimum signaling scheme in this case that 
maximizes C\ corresponds to \a\ = P, \b\ = Q or \a\ = Q, \b\ = P. 

To numerically illustrate the behavior of C\ as a function of p, we set = /3k = 0.1, a\ = a 2 = 1, 
P 2 = 10, Q 2 = 1, and plot C\ in Fig. |4] It is seen that C\ has its global minimum when p = 1, which 
corresponds to the case \a\ = \b\ as expected. Moreover, C\ , validating our conjecture, monotonically 
grows as p tends to its minimum and maximum values corresponding to the cases \a\ = Q, \b\ = P and 
\a\ = P, |6| = Q respectively. 

Note that in Fig. [4j the curves for Ho and Hi are mirrored versions of each other around p = 1 since 
we have = /?& in the example. From ( |40] ) we can say that the symmetry between Hq and Hi around 
p = 1 will exist whenever F = G, i.e., = fa. 
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Rician Fading: \a\ ^ \b\ 




Rician Fading: 
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Fig. 5. (a) The penalty term under Hi, i.e., Cf, for Rician channels with \a\ / \b\, as a function of the transmission levels a 
and 6. (b) The maximum contour is shown to exhibit the locus of the optimum signaling levels. C\ is color coded according to 
the color bar given next to the figure. 



E. Rician Fading 

In the Rician fading case, upon defining h^ — h^ — Hk from Section 



III-E 



we have cl 



\ah*+w*\ 2 



\bh*+w*\ 2 
^2 



when x\ — b. We 

will drop the subscript k in for convenience. We further define z a = ah^ + w% and = bh^ + 
that are zero-mean Gaussian variables with variances a\ and <j\, respectively. Then from Section 
similar to (f39l) we write the KL information as 



III-E 



Jf(tf) =/f(t*) + /^(E 



fk {j-k 



log 



1 + Gp-y 

1 + F-'Lp-^O 



log 



1 + FpeC- 



(41) 



where Ca 



lz a + (a-6) M | 2 



and ( b 



cf 

\z h +{b-a)^\ 2 
c^ 2 



cr 2 



. Now we will analyze the 



exponents ( a and 
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1) Case 1: \a\ ^ \b\ 

For ft = \ — -K > 0, i.e., \a\ > \b\, we can write ( a as 



Ca 



, 2 23ft{[i*(a-%]} |a-6| 2 |/i| 2 



(Jt 



. y-_ ,o 2V^3f?{z*(a-6)u} |a-6| 2 |u| 2 |a-6| 2 |u| 2 
|^a| 2 + — — + 1 1-^- + 



-( 



(a — 6)/i 



4 

a^ft 



4 

a^ft 



j2| mI 2 (^-i) 



+ 



<^ft 



z« (a ~ % 

+ 2 



<7, 



|a - 6| 2 |/i 



4 

2| ,,|2 



(42) 



(43) 



=tt a (l - p) + 



where we used o\k — 1 



|a — fo| 2 |/i| 



(|a|= 



-p 1 , a\n = p 1 (a 2 — a 2 ) while writing (43), and a 2 a n = p — 1 while 



(44) 



writing ( [44] ). Note that is a noncentral chi-squared random variable with two degrees of freedom and 
the noncentrality parameter X a = jtt^t^^a. Using \/^ft instead of it can be easily shown that 



(MH*I 2 )M' 
[44)) holds for ft < 0. Similarly one can obtain 



a = u b (i - p- 1 ) + 



a\ 2 \p\ 2 



a * at 



for ft ^ 0, i.e., \a\ ^ \b\, where = 



, (a-b)fi 



and u 6 - xi( A fe 



•fU \ A |a-6| 2 |/i| 2 crg 

Wltn A 6 - (| a |2_| 6 |2) 2(7 4. 



Accordingly, for the non-symmetric case where \a\ ^ \b\ from (41) we can write C\ as 



log 



l + Gp-ie " ( P )+ (ih 2 -m 2 K 

ufi o-m i 6 ~ a i 2 m 2 
1 + F-V _1 e J+ a»i a -M a W 



poo 

G / log 
Jo 



1 + G" V 



n(l o) I l a ~ b l 2 l^l 2 

^ P;+ ( | a |2_| 5 |2 )CT 2 



1 + 



^(l-P)+ (|a| 2_ |b| 2 )CT 2 



A a ?i ) du . 



(45) 



Similarly, we have Cq = — 1>\ — F 1 l2. 



The expression in ( |45j ) resembles the one in ( |40| ) for the Rayleigh fading case. And maximizing ( [45] ) 
analytically with respect to a and 6 seems even more intractable. Recall that in the Rayleigh fading case, 
the optimum signaling scheme was an OOK-like non-symmetric constellation, i.e., \a\ = P, |6| = Q or 
\a\ = Q, |6| = P. Considering the same power constraints we conjecture that the same signaling scheme, 
that maximizes the difference between the variances a 2 and a 2 , is optimum in this non-symmetric case. 
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Rician Fading: \a\ = \b\ 




Fig. 6. The penalty term C\ in Rician fading channels with \a\ — \b\, as a function of the mean and the variance of the channel 
gain, a = P = 10 and b = -P = -10. 



We provide a numerical example to illustrate the behavior of C\ as a function of a and b. Using the 
same values for a^, a^, a 2 , P 2 , Q 2 as in the Rayleigh fading case, and setting \i = 1 + j we plot 
in Fig. [5ja). The maximum contour of the three-dimensional surface in Fig. [5Ja), which corresponds to 
the potential optimum signaling level pairs, is clearly shown in Fig. (5^b). As seen in the figure C\ is 
maximized when \a\ = P, |6| = Q or |a| = Q, |6| = P validating our conjecture. 

2) Case 2: |a| = \b\ 



For k = 0, we have a 2 = a 2 , i.e., \a\ = \b\. Accordingly from (42) we write ( a = — s 2 — ^7 a , 
£b = — s 2 — ^-75 where similar to Section 



V-C 



we define s = 7a = »{5*(a - 6)//} and 76 



K{^(6 - a)fj}. Defining standard Gaussian random variables v a = \ a _^\ a J a and v h = | a _^| ao 7b» 



analogous to the AWGN case, we have ( a = — s 2 — \f2sv a and = —s 2 — V^su^. Therefore, from (41 ) 



C\ is given by (38). Accordingly, Lemma [2] applies here in the case of \a\ = \b\ under Rician channels. 
This case is analogous to the AWGN case since the received signal z\ has the same variance, but different 
means when x\ = a and x\ = b. Consequently, antipodal signaling is optimal. In Fig. [6j C\ is plotted 
as a function of the channel gain parameters |/x| 2 and g\. It is seen that C\ is increasing in |/x| 2 and 
decreasing in o\ when antipodal signaling is used, which corroborates Lemma [2] since s is increasing in 
|/x| 2 and decreasing in a\. 

In Fig. |7j the difference C\\ a \^\ h \ — ^\\ a \=^)\ * s plotted as a function of |/x| 2 and a\. For \a\ = \b\ 
antipodal signaling is employed; and for \a\ ^ \b\, OOK-like signaling is employed. It is seen that the 
OOK-like signaling is much better than antipodal signaling when the mean is low and the variance is 
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high. Although not visible in Fig. [7J antipodal signaling is only slightly better than OOK-like signaling 
when the mean is high and the variance is low. 

VI. Discussions 

Considering the unreliable detection of the sampling times under continuous channels, we should 
ideally integrate this uncertainty into the fusion rule of the FC. In other words, at the FC the LLR A^ of 
the received signal z\ should be computed at each time instant t if the sampling time of the k-th sensor 
cannot be reliably detected. In the LLR computations in (16) and (17) the prior probabilities Pi(x k = a) 



and Pi(x^ = b) are used. These probabilities are conditioned on the sampling time t\. Here, we need 
the unconditioned prior probabilities of the signal x\ which at each time t takes a value of a or b or 0, 
i.e, 

I a if L\ - L* i > A 
b if L k t -L% <-A (46) 
L n-1 
if L k t -L k tk €(-A,A). 

As before, the received signal at time t is z% = h\x\ + w\. Then, the LLR of z\ is given by 

~ xk = lQ (1 - p k )P k sA p(4\4 = a) + gfcP^nj = &) + (!- P h s,M4\4 - 0) 
* ° g a k ? k sfi p{z k \x k t = a) + (1 - a k )? k sfl p(z k \x k = b) + (1 - P k sfl )p(4\4 = 0) 

where is the probability that the FC receives a signal from sensor k under H^. Since the FC has 

no prior information on the sampling times of the sensors, this probability can be shown to be P r 1 fc1 , 

where E^[t^] is the average intersampling (communication) interval for sensor k under H^, i = 0, 1. For 
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instance, under AWGN channels [cf. ([16])] by defining c\ = \ z "~^ a \\ d\ = l*?-^ 2 , and g\ = ^ we 
have 

~ k= (1 - fo)Pjie-<* + foP*^ + (1 - P^)e-^ 

* og afc P^ oe - Ct fc + (i- afc )P^ e -* + (i-P^ ) e - 9 r 

Under fading channels A^ is computed similarly. Realizations of A^ of ([48]) and A^ of ( fT6] ) are shown in 
Fig.[8] where P — 10 is used. 







• 

1 


• 

< 

f 


1 





Fig. 8. Realizations of the LLRs and A J computed at the FC under reliable and unreliable detection of the sampling times, 
respectively. 

Note that in this case, {A^} are i.i.d. across time, and so are {A t } where X t = Y%=i *t is the § lobal 
LLR at time t. Hence, from Wald's identity, similar to Theorem 2 we can write Ei[7^ = El ^*~ 1 1 At ^ = 
| logo; | xherefore, we again need to maximize the KL information Ei[A+] (resp. — Eq[A+ 1) in order 

Ei [A t J 

to minimize the average delay Ei[7^ (resp. Eo[7^). However, analyzing this expectation is now much 
more involved than analyzing ( [37] ). On the other hand, in practice we need to ensure reliable detection of 
the sampling times by using high enough signaling levels P and Q. Then, the average delay performance 
of this unreliable detection scheme becomes identical to that of the reliable detection scheme analyzed 
in Section IVl 

As an alternative approach, in the unreliable detection case one can follow a two-step procedure 
to mimic the reliable detection case. Since it is known that most of the computed LLRs {A^} are 
uninformative that correspond to the no message case, a simple thresholding operation can be applied 
to update the LLR only when it is informative. The thresholding step is in fact a Neyman-Pearson test 
between the presence and absence of a message signal. The threshold can be adjusted to control the 
false alarm and misdetection probabilities. Setting the threshold appropriately we can obtain a negligible 
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false alarm probability, leaving us with the misdetection probability. Note that such a test would turn a 
continuous channel into a BEC with erasure probability, e^, equal to the misdetection probability. Recall 
that under BEC is the same as in the ideal channel case which corresponds to 



from Section 



III-A 



the reliable detection case here. Thus, if an LLR survives after thresholding, in the second step it is 



recomputed as in the channel-aware fusion rules obtained in Sections |III-C III-D and |III-E| Moreover, 



the KL information in (37), (39) and (41 ) will only be scaled by (1 — as shown in (35). Consequently, 
the results obtained in Sections |V-C[ |V-D[ and |V-E| are also valid in this approach to the unreliable 
detection case. 



VII. Simulation Results 

In this section, we provide simulation results to illustrate the performance of the channel-aware dis- 
tributed detection schemes based on level-triggered sampling. Assume there are two sensors collaborating 
with an FC. At each time t, each sensor makes a local observation y\ = s + k = 1,2, with 
v\ ~ A/" c (0, 1), 5 = 1 under Hi, and 5 = under Ho. Hence, the LLR, l\, of y\ is computed as 

i\ = m{ y \} - 1. 

Each sensor on average samples and transmits 1 bit to the FC once every four samples they observe, 
i.e., T = 4. And the local threshold A is determined to meet this average sampling interval. It has been 
shown in [12, Section IV-A] that one can use the equation A tanh(y ) = T^p- to find A. Then, using the 
A value the local error probabilities and are computed offline for each sensor. From Lemma [T] and 
Proposition [2J we have | log a\ — 6 < A, A < \ loga\ and | log f3\ — 6 < B, B < | log f3\ where 6 is the 
largest received LLR magnitude. Hence, we can set the global thresholds A and B to their upper bounds 
|loga| and |log/3| respectively to meet the constraints Po(5f = Hi) < a and Pi(5^ = Hq) < (3. To 
achieve the equalities, A (resp. B) should be found via simulations within the interval [| log a\ — 6, | log a\] 
(resp. [| log/3| - 9, | log Note also that A,A~\ loga\ and B,B ~ \ log/3| as a, (3 -> 0. 

We compare our channel-aware designs with the conventional approach where the FC first decides on 
the received data bit and then uses it to update the test statistic. Under BEC and BSC, since the received 
signal is already binary, in the conventional approach the FC simply treats the channel as ideal. On the 
other hand, under AWGN, Rayleigh fading, and Rician fading channels, in the conventional approach the 
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Fig. 9. Error performance comparison between the proposed channel-aware approach and the conventional methods. 



FC first demodulates the received bit by using the following maximum-likelihood (ML) decision rules 

AWGN : max $l{(z*)* h k x*} , 

x*e{a,b} 



exp(- 



l^l 2 



Rayleigh : max 



ie{a,b} \x k n \ 2 G 2 + G 2 

exp (_ t \<^M 2 



Rician : max 



x*e{a,b} \Xn\ 2 crl + (J 2 



Then, it updates the test statistic either by treating the channel as ideal, i.e., using ( [10] ), (note that this 
approach cannot guarantee to satisfy the target error probabilities since its performance highly depends 
on the performance of the receiver block) or more reasonably by treating the channel as a BSC assuming 
the error rate of the ML receiver is known, i.e., using ( [T2] ). 

A. Error Performance 

Firstly, we demonstrate that the channel-aware designs presented in this paper can meet the target error 
performance in noisy channels.. We set a = (3 = 10 -2 , A = |loga|, B = \ \og/3\, E[\h^\ 2 } = 1 (i.e., 

\hk\ 2 = l,Vfc for AWGN channels; a\ k = l,Vfc for Rayleigh fading channels; and o\ k = 0.5, |^/e| 2 = 

a EN/i fc l 2 l I I 

0.5, V/c for Rician fading channels). We define SNR= u cr | J . As an example, in Fig. 9 we show the 

actual error performances in Rician fading channels for both the proposed channel-aware approach and 

the conventional methods. The error performance under ideal channels is also shown. It is seen that 

the channel-aware method and the conventional method treating the channel as BSC can always meet 

the specified error bounds under different channel conditions. In fact, they achieve even smaller error 
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Fig. 10. The average decision delay vs. the achieved error rates in Rician fading channels. 



probabilities under bad channel conditions, i.e., low SNR or high e, since they update the test statistics 
even more cautiously with smaller increments. However, the conventional approach that treats the channel 
as ideal is vulnerable to noisy channels. Its error performances are far away from the bounds especially 
at low SNR. Similar results are observed for the other noisy channel types. 



B. Detection Delay Performance 

We now show the actual decision delay performance of the proposed channel-aware approach as a 
function of the achieved error rates. In this subsection, different from the previous one we do not determine 
the thresholds A and B for the given error probability bounds. But rather, for a specific set of A and B 
values we simulate the schemes to obtain their operating characteristics, i.e., the average decision delay 
and error probabilities. For fair comparisons we set the channel error probabilities of discrete channels, 



i.e., BEC and BSC, to e& = 0.1, V/c; and set SNR= OdB for all continuous channel types. Fig. 10 



compares the channel-aware scheme to the two conventional schemes assuming ideal channels and BSC, 
respectively, after bit recovery under Rician fading channels. The average decision delay of the channel- 
aware scheme is significantly lower than those of the conventional schemes. Moreover, the channel-aware 
scheme provides more achievable error probabilities than the conventional schemes, since the step sizes 
are much finer for the channel-aware scheme. The discrete nature of the average decision delay curve 
is due to having finite number of values to update the test statistic at the FC. This phenomenon was 



explained in detail in 1 12 1. The conventional schemes have only two possible update values that are given 



in (fTOJ) and (12), whereas the channel-aware scheme uses a continuum of values to update its LLR sum 
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Fig. 11. The average decision delay as a function of the achieved error rate under different discrete channels. 
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Fig. 12. The average decision delay as a function of the achieved error rate under different continuous channels. 



as given in ( flT] ). Similar results can be obtained for the other channel types. 

Next, we compare the decision delay performances of the channel-aware schemes under different 



channels. Fig. 11 and Fig. 12 show the results for the discrete channels and the continuous channels, 



respectively. It is seen that BEC has a superior performance than BSC. Note from Fig. [TTJthat the step 
sizes are large and the number of achievable error probabilities is the same for all three cases since there 



are only two LLR update values [cf. ([10|)-(|13])]. For each continuous channel type, the corresponding 
signaling scheme discussed in Sections V.C-E is used in the simulations. As expected the AWGN channel 
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case has a much better performance than the fading channel cases since under AWGN, the channels are 
deterministic and known to the FC, whereas in fading cases the channels are random and only the 
statistics are known to the FC. Moreover, under Rayleigh fading, channels have zero mean increasing the 
uncertainty, hence this case has the worst performance among the continuous channel types. Finally we 



consider the fusion rule ([48]) that takes into account the unreliable detection of the sampling times. We use 
SNR= dB for all channels; P = 20 under AWGN; P = Q = 20 under Rician; and P = 100, Q = 20 



under Rayleigh. In Fig. [12} it is seen that the channel aware scheme has almost identical performances 
in the reliable and unreliable detection cases under all continuous channels. 



VIII. Conclusions 

We have developed and analyzed channel-aware distributed detection schemes based on level-triggered 
sampling. The sensors form local log-likelihood ratios (LLRs) based on their observations and sample 
their LLRs using the level-triggered sampling. Upon sampling each sensor sends a single bit to the 
fusion center (FC). The FC is equipped with the local error rates of all sensors and the statistics of the 
channels from all sensors. Upon receiving the bits from the sensors, the FC updates the global LLR and 
performs an SPRT. The fusion rules under different channel types are given. We have further provided 
an asymptotic analysis on the average decision delay for the proposed channel-aware scheme. We have 
shown that the asymptotic decision delay is characterized by a KL information number, whose expressions 
under different channel types have been derived. Based on the delay analysis, we have also identified 
appropriate signaling schemes under different channels for the sensors to transmit the 1-bit information. 
Numerical examples have demonstrated the advantages of the proposed channel-aware approach over the 
conventional methods. 
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Appendix: Proof of Lemma 2 
We will present the proof under Hi, and the proof under Hq follows similarly. We need to find the 



dC k 

condition for > 0. From pS}, we have 



dC\ 
ds 



= E 



(V2v - 2s)e- s2+ ^ SI/ {G - F" 1 ) 



+ 



(-V2u - 2s)e- s2 -^ s,/ (l - FG) 



(1 + Ge- s2 +^)(l + F-ie-*+^ sv ) (1 + G- l e~ s2 -^ sv ){l + Fe~ s2 -^ s 

(V2v - 2s)e- s2 +^ su 



= (FG 



F(l + Ge" s2 +^)(1 + F- 1 e~ s2 +^ sv ) 
(-V2u - 2s)e- s2 -^ su 



FG — 1 



(1 + G~ l e- s2 -^ sv ){\ + Fe~ 
( v / 2^-2s) e -(^- s ) 2 



-V2s 



+ 



e 2 



dv 

(V2v + 2s)e-(^ +s 



F(l + Ge" s2 +v / 2^)(l + F- 1 e~ s2 +^) (1 + G" 1 e- s2 -v / 2^)(l + Fe~ s2 -^ S1/ 



dv 
(49) 



If we choose A > 0, then we will have + Pk < 1 which in turn yields FG > 1, but here we 
will reasonably assume that ak,(3k < 0.5 and accordingly F, G > 1. Therefore, it is clear that in order 
to conclude the proof we need to show that the integral in (49) is positive. Define r± = — s and 
V2 = ^= + s, then we need to show the following inequality 

2rie~ r i 2rie~ r i 



F(l + Ge s2 + 2s) -i)(l + F- 1 e« a + 2ar F(l + Ge s2 - 2 ^)(1 + F-V 2 " 2 ^; 

2r2e _r 2 2r2e _r2 



dri > 



(1 + G-V 2 - 2 ^)(l + Fe s2 - 2sr ?) (1 + G-V 2 + 2s ^)(l + Fe s2 + 2s ^ 
Note that ( |50| ) holds if the following inequality holds, 

1 



dr 2 . (50) 



1 



F(l + Ge g2 + 2sr )(l + F-V 2 + 2 ^) F(l + Ge s2 ~ 2sr )(l + F-V 2 " 2 ^) 

1 

+ 



> 



1 



(l + G-V 2 - 2s) -)(l + Fe s2 - 2s) -) (l + G- 1 e s2 + 2s) ')(l + J Fe s2 + 2 ^) 
Thus, after rearranging terms it is sufficient to show that 

(G _1 F - G)e s22+4s) - + (FG + 1)(G _1 - l)e s2+2sr + (1 - F) 



(51) 



(1 + Ge s2 + 2 ^)(1 + F"V 2 + 2s) -)(l + G" 1 e s2 + 2 ^)(1 + Fe* 2 + 2s) -) > 

(G~ l F - G)e 2s2 ~ 4sr + (FG + 1)(G~ 1 - l)e s2 ~ 2sr + (1 - F) 
(1 + Ge* 2 - 2sr )(l + F-V 2 - 2 ^)(l + G- 1 e s2 - 2 ^)(l + Fe s2 " 2 ^) ' ( 2 
Define p = s 2 + 2sr, g = s 2 - 2sr, C x = G - g, G 2 = (FG + 1)(1 - £), G 3 = F + F" 1 + G + G"\ 

G4 = 2 + FG + g + ji + jyj. Multiplying both sides with —1, and rearranging terms we can rewrite 
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( [52] ) as follows 

(de 2p + C 2 e p + F- l)(e 4q + C 3 e 3q + C 4 e 2q + C 3 e q + 1) < 

(de 2q + C 2 e* + F - l)(e 4p + C 3 e 3 ^ + C 4 e 2p + C 3 e^ + 1). (53) 
After some manipulations, we obtain the following inequality 

CiC 3 (e 2 ^ - e p+2q ) + d(e 2p - e 2q ) + C 2 (e p - e q ) < 

d(e 4p+2q - e 2p+4q ) + C 2 (e 4p+q - e p+4q ) + 
(F - l)(e 4p - e 4q ) + dC 3 (e 3p+2q - e 2p+3q ) + 
C 2 C 3 (e 3p+q - e p+3q ) + C 3 (F - l)(e 3p - e 3q )+ 

C 2 C 4 (e 2p+q - e p+2q ) + C A (F - l)(e 2p - e 2q ) + C 3 (F - l)(e p - e q ). (54) 

Finally, noting that p > q (since s > 0, r > 0) if we cancel the common term e p — e q , then the inequality 
that we need to verify becomes the following 

C 1 C 3 e p+q + C 1 (e p + e q ) + C 2 < 

dC 3 e 2p+2q + de 2p+2q (e p + e q ) + C 2 e p+q (e 2p + e p+q + e 2q ) + 
(F - l){e 2p + e 2q )(e p + e q ) + C 2 C 3 e p+q (e p + e^) + 
C 3 (F - l)(e 2p + e p+q + e 2q + 1) + C 2 C 4 tf+ q + C 4 (F - l)(e^ + e 9 ). (55) 



Now assuming that C\ > 0, i.e., G 2 > F, it is straightforward to verify the inequality in (55) 



Since we have p + q = s 2 > 0, we also have e p+q < (e p+q ) 2 , e p + e q < e 2p+2q (e p + e q ), and 



e pJrq {e 2p + e p+q + e 2q ) > 1. Note also that the last five terms on the right hand side of (55) are 
positive due to having F > 1, C\ > 0, C 2 > 0, C 3 > 0, C4 > 0. Hence, is increasing in s for all k 
when G 2 > F. Similarly we can show that Cq is increasing in s for all k when F 2 > G. 
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